Data Structures for Range-Aggregate Extent Queries
نویسندگان
چکیده
A fundamental and well-studied problem in computational geometry is range searching, where the goal is to preprocess a set, S, of geometric objects (e.g., points in the plane) so that the subset S ′ ⊆ S that is contained in a query range (e.g., an axes-parallel rectangle) can be reported efficiently. However, in many situations, what is of interest is to generate a more informative “summary” of the output, obtained by applying a suitable aggregation function on S ′. Examples of such aggregation functions include count, sum, min, max, mean, median, mode, and top-k that are usually computed on a set of weights defined suitably on the objects. Such range-aggregate query problems have been the subject of much recent research in both the database and the computational geometry communities. In this paper, we further generalize this line of work by considering aggregation functions on point-sets that measure the extent or “spread” of the objects in the retrieved set S ′. The functions considered here include closest A preliminary version of this paper appeared in the Proceedings of the 20th Canadian Conference on Computational Geometry, Montreal, Aug. 13–15, 2008, pp. 7–10. Corresponding author Email addresses: prosenjit [email protected] (Prosenjit Gupta), [email protected] (Ravi Janardan), [email protected] (Yokesh Kumar), [email protected] (Michiel Smid) Research supported in part by grants SR/S3/EECE/22/2004 and DST/INT/US/NSFRPO-0155/04 from the Department of Science and Technology, Government of India. Research supported, in part, by the National Science Foundation under grants INT– 0422775 and CCF-0514950. Research supported by NSERC. Preprint submitted to Elsevier July 27, 2009 pair, diameter, and width. The challenge here is that these aggregation functions (unlike, say, count) are not efficiently decomposable in the sense that the answer to S ′ cannot be inferred easily from answers to subsets that induce a partition of S ′. Nevertheless, we have been able to obtain spaceand query-time-efficient solutions to several such problems including: closest pair queries with axes-parallel rectangles on point sets in the plane and on random point-sets in R (d ≥ 2), closest pair queries with disks on random point-sets in the plane, diameter queries on point-sets in the plane, and guaranteedquality approximations for diameter and width queries in the plane. Our results are based on a combination of geometric techniques, including multilevel range trees, Voronoi Diagrams, Euclidean Minimum Spanning Trees, sparse representations of candidate outputs, and proofs of (expected) upper bounds on the sizes of such representations.
منابع مشابه
Range-Aggregate Queries for Geometric Extent Problems
Let S be a set of n points in the plane. We present data structures that solve range-aggregate query problems on three geometric extent measure problems. Using these data structures, we can report, for any axis-parallel query rectangle Q, the area/perimeter of the convex hull, the width, and the radius of the smallest enclosing disk of the points in S ∩Q.
متن کاملارائه روشی پویا جهت پاسخ به پرسوجوهای پیوسته تجمّعی اقتضایی
Data Streams are infinite, fast, time-stamp data elements which are received explosively. Generally, these elements need to be processed in an online, real-time way. So, algorithms to process data streams and answer queries on these streams are mostly one-pass. The execution of such algorithms has some challenges such as memory limitation, scheduling, and accuracy of answers. They will be more ...
متن کاملAg-tree: an Index Structure for Range-aggregation Queries in Data Warehouse Environments
Range-aggregate queries are popular in many applications in data warehouse environments with large business relational databases. To evaluate these efficiently, several studies on data cubes (such as the aggregate cubetree) have been carried out. In the wellknown aggregate cubetree, each entry in every node stores the aggregate values of its corresponding subtree. Therefore, range-aggregate que...
متن کاملEfficient external memory structures for range-aggregate queries
We present external memory data structures for efficiently answering range-aggregate queries. The range-aggregate problem is defined as follows: Given a set of weighted points in R, compute the aggregate of the weights of the points that lie inside a d-dimensional orthogonal query rectangle. The aggregates we consider in this paper include COUNT, SUM, and MAX. First, we develop a structure for ...
متن کاملAccuracy Vs Lifetime: Linear Sketches for Appoximate Aggregate Range Queries in Sensor Networks
Query processing in sensor networks is critical for several sensor based monitoring applications and poses several challenging research problems. The in–network aggregation paradigm in sensor networks provides a versatile approach for evaluating simple aggregate queries, in which an aggregation–tree is imposed on the sensor network that is rooted at the base–station and the data gets aggregated...
متن کاملAnswering Approximate Range Aggregate Queries on OLAP Data Cubes with Probabilistic Guarantees
Approximate range aggregate queries are one of the most frequent and useful kinds of queries for Decision Support Systems (DSS). Traditionally, sampling-based techniques have been proposed to tackle this problem. However, its effectiveness will degrade when the underlying data distribution is skewed. Another approach based on the outlier management can limit the effect of data skew but fails to...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Comput. Geom.
دوره 47 شماره
صفحات -
تاریخ انتشار 2008